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CHAPTER 1 OUTLINE 


The pPD7755 /7756 is a speech synthesis LSI that employs 
the waveform coding method. By combining the ADPCM coding 
method and phoneme method, natural synthesized speech has 
been realized. Also, the wide ranye of the operating volta- 
ges, compact packaye, standby function, etc., make these 
LSIs ideal for various application systems such as battery- 


driven systems. 


1.1 Features 


Synthesizing method :Combined ADPCM and phoneme methods 
Sampling frequency 24, 5, 6 or 8kHz 
Bit rate (speech) 28 to 32kbDps 


Built-in speech data ROM :96K bits (yPD7755 ) 
256K bits (yPD7756 ) 

Synthesizing time 212 sec. max. (yPD7755 ) 
32 sec. max. (yPD7756 ) 

Built-in D/A converter 29-bit resolution 
current output 


Built-in standby function:Popcorn noise suppressor 


incorporated 
Supply voltage $2.72 £0 SeS¥ 
process > CMOS 
Package :18-pin plastic DIP 


I0 


17 


X1 
OSC 
Speech data X2 
memory 
REF 


D/A 
A 
System Controller f ma: AVO 
Converte 


decoder 


Fig. 1-1 System Configuration 


CHAPTER 2 PIN FUNCTIONS 


2.1 Pin Configuration (Top view) 


4 o 13 
I50 6 [2 
I6 O o Ti 
170 o 10 
REFC O ST 
AVOO 0 CS 
BUSYO oxi 
RESETO O X2 
GNDO O Vp 


2.2 Pin Functions 

(1) 10-17 (Input) 
These eight pins together function as the messaye 
select input. Positive logic is used for these pins 
(high level=1). Each input is provided with an input 
latch that latches at the rising edge of the ST 
input. During standby mode, these pins should be set 
to either high or low level; If they are biased at 
or near the typical CMOS swith input,causing excess 
current drain. 

(2) CS (Input) 
This is the chip select input pin. When a low level 


signal is input to this pin, the ST input is enabled. 


(3) 


(4) 


io) 


(6) 


(7) 


(8) 


ST (Input) 


This is the start Signal input pin. When a low level 
Signal is input to this pin when the CS signal is 
also low level, Speech synthesis of the message 
Stored in the speech RoM location addressed by the 
contents of IO-I7 begins. 

When the CS and ‘ST signals are both set to low level, 


the standby mode is released, 


BUSY (Output) 


This is an active low level pin that Outputs the BUSY 
Signal. A low level Signal is Output from this pin 
when the start Signal is accepted; and once this 
output becomes low level, the start input will not 

be accepted. The Output from this pin becomes high 
impedance in Standby mode, 

REF (Input) 

This pin inputs the reference current for the D/A 
converter. The output current of the D/A current is 
controlled by the current input to this pin. This pin 
Should be connected to Vpp via a resistor. In standby 
mode, this pin becomes high impedance, 


AVO (Output) 
This pin outputs the analogy speech Signal, which is 


a unipolar, sink Current. 


RESET (Input) 


This is the reset Signal input Pin and is used to 
initialize the LSI at power up, abort the speech 
Synthesis, and release the Standby mode. To reset the 
chip, this Signal must be held low for at least 12 
oscillator clocks, When the Standby mode is released, 
at least 12 clocks must be input after clock Ooscilla- 


tion completes, 


Sly X2 

These are the Clock pins and are connected to a cera- 
mic resonator (640kHz). During standby mode, the xX] 
becomes low level and the X2 becomes high level. 


2=2 


(9) Vpp 
This pin should be connected to the power supply. 


(10) GND 
This pin should be connected to ground. 


CHAPTER 3 OPERATION 


3.1 Sample Frequency 
The relation between Synthesized speech messaye lenyth 
and bit rate when the ADPCM method is used is shown in 
Table 3-1. 


Table 3-] Sampling Frequency and Maximum Messaye Length 


Sampling 
frequency 


For normal speech with no background music,about 103% to 20% of 
the message is made up of silent frames. These frames are 
compressed in the case of the uPD7755 /7756 so the 

actual bit rate is about 80% to 90% of that shown in 

Table 3-1. 

Even further reductions in the bit rate have been achieved 

by combining the ADPCM and phoneme methods. The bit rates 

for speech synthesis when this combined method is used 


are shown in Table 3-2. 


Table 3-2 Sampling Frequency and Maximum Messaye Length 
(with data compression) 


Sampling 
frequency 
(kHz) 


3.2 D/A Converter 
The built-in D/A converter is a unipolar, current output 
type with a 9-bit resolution. The digital input to the 
converter is converted to a 9-bit offset binary code. 
This code has a range of 0 to 1FFH with a midrange value 
of 100H. 
The schematic drawing of the D/A converter is shown in 
Pig. 3-1. As can be seen, a constant-current source is 


connected in parallel for each bit of the digital input. 


HPD7755/7756 


Tayo: Output current 
Ry? Load resistance 


AVO ; 
Vo: Potential across RL 


AVO 


————$—_—__, ———_ 
9-bit digital input 


Figs- 3-1 Schematic Diagram of D/A Converter 


3.2.1 Output Current and Reference Resistance 
The output current of the D/A converter can be controlled 
by the voltage applied to the REF pin. This is realized 
by the resistor connected across the Vpp and REF pins as 
shown in Fig. 3-2. 
The relations between the voltage to the REF pin, VprEF, 
and input reference current Ippp or D/A converter output 


current Iyao are shown in Figs. 3-3 and 3-4. 
If the impedance of an amplifier to be connected to the 


Avo pin is as high as several kg or more, refer to Fig. 


3-3 to determine the value of Ippr. In Fig. 3-3, the 


ranye Of Ipepy is 60 (at the intersection ot the load 
line, Rrer, and the MAX. curve) to 8UyWA (at the intersec- 
tion of RReEF and MIN.) when Rpgp is 50k and Vpp is SV. 
Under this condition, the ranye of Tayo is 1.9 to 2.9ma 
because Ivyao is 32 to 36 times that of Ire. Also, if the 
impedance is high, fluctuations in the output level of 
the synthesized speech are about 4dB, which is not so high. 
Therefore, a highly precise fixed resistor can be used as 
RREF instead of the variable resistor except for occa- 
sions when the Synthesized speed Output level must be 
constant. In contrast, if the impedance of the amplifier 
connected to the AVO pin is low, determine RREF by 
referring to Fig. 3-4. In this figure, the — of 

Tavo at a Vpp of 5V and a RREF Of 1kQ is 12 to 30mA. 1% 3 
the impedance is low, fluctuations in output levels of 
the synthesized speech are increased to about 8dB. 
Therefore, using a variable resistor such as Rrer is 


recommended, 


Vpp Vpp 


HPD7755, #PD7756 


AVO >— Iavo 


Fig. 3-2 D/A Converter Reference Current 


lavo(mA) Iper (HA) 


7 


f=3) 


Vrer (V) 


Fig. 3-3 Relation Between Veer and IprsF/1 avo 
(in Low-Current Area) 


Tavo (mA) Iper (HA) 


Vrer (V) 
Fig. 3-4 Relation Between VREF and IREF/I ayo 


(in High-Current Area) 


3.2.2 Setting Output Current 


Because the output current from-the AVO pin (I jis a 


Sink-load current, the load resistance (RL) re 
to Vpp. Therefore, the potential at AVO must be within 
the range at which the output transistor of the D/A con- 
verter will operate as a constant-current supply. 
The operating voltage range is: 

When Vpp=5v:  1.5V to Vop 


When Vpp=3v: lv to vpp 
The load resistance and Output current should be deter- 


mined so that the Operating voltage applied to the Output 

transistor falls within this range, 

To adjust output current Iavo, observe the following pro- 

cedure, 

(1) Select the message to be synthesized (to allow the 
D/A converter to output the middle(bias) current), 


(2) Input the RESET signal within 3 seconds after the 
synthesized speech has been output (to prevent the 
LSI from entering standby mode) . 


(3) Adjust Rppr so that the voltaye applied to the Iayo 
falls within the rated value. 


3.3 Standby Mode 

When the »PD7755 /7756 is not performing speech synthesis, 
it is set in standby mode. In this mode, current consump~ 
tion is reduced to less than lyA (TYP). The condition of 
the pPD7755 /7756 in standby mode is as follows: 

° The clock stops 

° The BUSY pin becomes high impedance 

° The REF input pin becomes high impedance 

° The X1 becomes low level and the X2 becomes high level. 


° I0-I7, ST, CS, and RESET inputs are all enabled 
° The Tayo becomes 0. 


3.3.1 Entering standby mode 
The »pPD7755 /7756 enters the standby mode if the 
following condition exists for more than 3 seconds after 
completion of speech synthesis: 
(1) CS or ST is high level 
(2) RESET is high level 


3.3.2 Releasing standby mode 
Standby mode is released by the following procedure: 
(1) Set CS to low level 
(2) Set ST to low level 
At this time, the messaye select code will be input 
to 10-17. 


3.3.3 Eliminating popcorn noise in.standby mode 
Because the »PD7755 /7756 uses a unipolar 9-bit D/A 
converter in operation mode, there is a bias current 
output even when there is no signal input (AC signal). 
In standby mode, the input value to the D/A converter 


becomes 0 as does the output current. 


3-6 


Therefore, when the LSI moves from operatiny mode to 
standby mode, popcorn noise may be yenerated by the 
sudden changes in the Output of the D/A converter. To 
prevent this, the output of the D/A converter is grad- 
ually reduced prior to entering the standby mode (see 
Fig. 3-5). 

Also, when chanying from standby to operating mode, the 
D/A output is gradually raised to AC 0 from the time when 
clock oscillation starts and completes in response to the 
Standby release signal generated by input of the CS and 
ST Signals (see Fig. 3-6). 

The transition time for the D/A output is approximately 
46.5ms and which enough to Suppress this type of noise, 


D/A converter input 
1FFH 


AVO 1001 


000H 
D/A output 


Operating = transition time | Standby mode 


! 
Start standby operation 
Fig. 3-5 D/A Converter Output when Entering Standby Mode 


D/A converter input 
1FFH 


AVO 100H 


000H 
Clock oscilla D/A output 


Standby = tion time | transition time epsret ing mode 


Standby release signal 
(Start speech synthesis signal) 


Fig. 3-6 D/A Converter Output when Releasing Standby Mode 
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3.3.4 Pin settings in standby mode 

The following cautions should be observed in regard to 

the BUSY signal output and the messaye select code input 

(10-17) in standby mode. 

(1) BUSY pin 
Because the output from this pin becomes high imped- 
ance in standby mode in implementations where this 
signal is checked the external controller must be 
configured to insure that this signal is pulled up. 

(2) I0-I7 pins 
If these pins are not set to high or low level in 
standby mode, there is achance that excess current will 
drain. These pins should therefore be set either to 
high level or low level during standby mode. If these 
pins are connected to a circuit (such as a bus) with 


a floating state, they must be pulled up or down. 


3.4 Start Speech Synthesis 
Speech synthesis begins when the signal input to the ST 
pin goes low when the signal at the CS is already low. 
Note that the operation performed when a pulse is applied 
to the ST pin is somewhat different from that when it is 


fixed at low level. 


3.4.1 ST pulse input 
The timing when the signal input to the ST pin is a pulse 
is shown in Fig. 3-8. The data on I0-I7 (message select 
data) is latched at the rising edge of the signal input 
to the ST pin. After this data is latched, there will be 
no effect on the speech synthesis operation even if both 
the GS and the ST signals are set to high level. 
Since the speech synthesis start control circuit continues 
to operate during standby mode, the start procedure is the 


same in both standby and operating modes. 


3.4.2 ST fixed input 
When the ST input is fixed at low level, the speech 
Synthesis operation is performed repeatedly (see Fiy. 3-9). 
In this state, the messaye select code (the contents of 
I0-I7) are not latched so this data should not be changed. 


Changing this data may result in misoperation,. 


3.4.3 Standby mode 
Since the speech synthesis start control circuit con- 
tinues to operate during standby mode, the same procedure 
as that described in 3.4.1 can be used to Start speech 
Synthesis. In this case, the clock oscillation begins 
when both the ST and CS pins become low level and the 
BUSY signal is output after the clock oscillation has 
become stable. Then, as described in section 3.3% the 
Speech output begins approximately 46.5ms later (See Fig. 
3-10). 


Table 3-2 Speech Synthesis Start Timing 
(Vi p= +2, 73~ 5,5, £ 


tsps Standby mode * 
(includes 
oscillation 
Start time) 


sso | operation moae| | 2.1 [2.2 | ns _| 
rsss_| standby mode | [2.1 [2.2 | ms _| 


tDA Standby mode 46.5 iat ell 


Note: Ceramic resonator Kyocera Corp. KBR-640B, Cl=C2=150 pF 


Xi 
= 
X2 
Cl C2 
La 


Fig. 3-7 Oscillator 


osc= 640KHz) 


BUSY output 


time 


Speech output 


Start time 


3=9 


3.4.4 At reset or power Up 


If the RESET signal is input while a speech is being 
synthesized, or on power application, the data input to 
the D/A converter is undefined but within the limit of 0 
to 100H. 

If speech synthesis is started under this condition with 
both the ST and CS pins being low level, the input data 
of the D/A converter is shifted to 100H and then the 
speech is output. 

The input to the D/A converter is shifted yradually to 
100H in the same manner as described in section 3.3, the 
synthesized speech will be output a maximum of 47ms after 
the RESET signal has been input or the power has been 


applied. 


po 


cS 
M | 
ST — 
— ; , oT pulse width 
R—- tcc 
H 
L 
| 
H ! 
BUSY 
L \ 
| Speech output start time 
BUSY output time tsso 1 tsso ! 
AVO Speech output 
I 


Fig. 3-8 Speech Synthesis Start Input 
(When ST Input is Pulse) 
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H 
BUSY 
L —- | 1 1 
I tsspo | l 
by 
ae. t 
| BUSY output time So Speech output start ¢t 
| 
ee Speech output start time ! 
AVO Speech output 


Fig. 3-9 Speech Synthesis Start Input 
(When ST Input is Fixed Low Level) 


D/A converter input 
LF 
AVO 100- 


000 


Fig. 


| —— ; 
-.—| ST pulse width 14, 


| Busy output time |! 


D/A converter tr iti j 
tens / ansition time 


Speech output start time 


tpa 


Speech output 


3-10 Speech Synthesis Start Operation 


(Standby Mode ) 


3.5 Reset 
The RESET signal is used to initialize the LSI on power 
application, terminate the Synthesized speech output, or 
release the standby mode, 
The uPD7755 /7756 is reset by holdiny the RESET pin to 
low level for the following period of time. 


° During operation 12 oscillator clocks (min.) 
(18.75ys at 640kHz) 
° On power application The same as above after 
in standby mode Clock oscillation is completed 


CHAPTER 4 INTERFACES 


4.1 Message Select Input 

4.1.1 Host control mode 
Fig. 4-1 shows an example when a wPD80C48 is used as the 
host CPU. The message select code is output to the data 
bus and this data is written to the pPD7755 /7756 by 
setting the CS and ST signals to low. 
Because the output of the BUSY pin becomes high impedance 
in standby mode, this signal must be pulled up. (In the 
example in Fig. 4-1, because the ports of the ypPD80C48 
are provided with built-in pull-up resistors, the pull-up 
resistor shown can be omitted.) Also, in standby mode, if 
the bus to which I0-I7 is connected becomes high 
impedance, there isa chance that excess current will drain.In 
this case, these signals should always be connected 
either to pull-up or pull-down resistors. Fig. 4-2 shows 
a flowchart when control is performed by the host CPU. The 
circuit that checks the BUSY output in standby mode must 
be provided with a wait equivalent to the clock oscilla- 


tion start time. 


4.1.2 Key input mode 
Fig. 4-3 shows an example of the wPD7755 /7756 used 
alone. If the ST input switch is fixed to ground, the 
speech synthesis output will repeat indefinitely. 
Unnecessary I0-I7 inputs should be connected to ground. 


DBO~DB7 Io ~Iy, 


HPD80C48 HPD7755,4PD7756 
* 


* WR 


Fig. 4-1 Circuit Example when Control is by Host 


* Use 5V+10% for Vpp- When a 3V line is used, the 


minimum pulse width for the yPD7755 /7756 ST input 
becomes 2us and it no longer possible to use the WR 


Output of the u PD80C48, 


BUSY output > Check ypPD7755 /7756 
BUSY output 


Output low to 


cs. 


Set message 
select data 
in accumulator 
Output contents > WR is output with output 
of accumulator to bus; WR functions as 


to data bus start signal 


Fig. 4-2 Flowchart (Control by Host) 


Message 
select 
Switch 


Start 
switch 


I, 
Ig 
| 
I 
| 
| 
To 
47kQ.X8 HPD7755,u4PD7756 
Vpp —_ 
GC 
47kQ . 
Zz ST 
I Vpp 
47kQa 
RESET 
-+- 
UE 10 ee 


Key Input Mode Circuit Example 


4.2 Lowpass Filter 
Signals that have been digitally analyzed and synthesized 
by a sampling method can only reproduce frequency ranyes 
up to one half the sampling frequency. Frequencies above 
this range appear as unwanted noise and if the D/A output 
is directly amplified, the result would be a poor quality 
sound with an extremely unfavorable S/N ratio. It is 
therefore necessary to use a lowpass filter that will 
only allow signal frequencies of less than half the 
sampling frequency to pass. 
A filter with sharp bandpass characteristics is desirable 
and ideally a filter with damping characterics of 
48dB/oct or better should be used. The complexity of such 
a filter, however, is prohibitive and we have opted, in 
the interest of simplicity and operability, for a filter 
with characterisics of 24dB/oct for use in the wPD7755 / 
7756. Fig. 4-4 shows a pPC358C analog filter with 
Butterworth damping characteristics (24dB/oct) that can 


operate on a +5V single power supply. 


Vee Vcc 


§kQ 5kQ 
HPD7755, HPC358C 


HPD7756 Ru las , i a | 
ee. R4 RS | > 


Avo : , FILTER 


OUT 


Fig. 4-4 Lowpass Filter Circuit Example 


when the pPC358C operates on +5V, input signals in a 
range of 0 to 3.5V may be used. Resistors Rl and R2 
should therefore be used to set the operating point to 
225Ve 


Table 4-1 shows the appropriate constants when sampling 


frequencies (fcoypzi) of 4, 5, 6, and 8kHz are used. 


Table 4-1 Lowpass Filter Constants 
(24dB/oct Butterworth Characteristics) 


(kHz) | (KQ) | (KQ) 
| 4 {2 | 
| & | 10 | 
| 6 | a2] on 


Cl 0.022 uF 
C2 0.0022 uF 
C3 0.022 uF 
C4 0.01 uF 
RL l KQ 


4.3 Power Amp 
Because the signal obtained as the Output of the lowpass 
filter described in the preceding paragraph has a wave- 
form identical to that of a normal analog signal, this 
Output can be input to any type of power amp as required. 
Fig. 4-5 shows an example of a power amp (ywPC1212C) with 
a low operating voltage that can be used with the +5v 
Single power supply of the wPD/755 /7756 « 


Fig. 4-5 Power Amp (O.7W, 5V) 


Because the wPC1212C can operate on voltayes in the range 
of 4.5V to 7.0V, if a 42 speaker is used with 5.0V power 


supply, a 0..7W (THD=10%) output is obtained. 


CHAPTER 5 SPEECH ANALYSIS 


The procedure for developing a ROM code for the ypED7 755 7 
7756 is shown in the flowchart in Fig. 5=1. 


The main points 
of the procedure are described below. 


Development 
decision 
Message script 


Message recording 


Original speech 


evaluation ,editing 
Parameter 
selection 
Analysis 
Order 
Analysis/coding 


<Greck> OK 


NEC 
NG 
Parameter 
adjustment 
User 


Mask ROM order 


Fig. 5-1 ROM Code Development Flowchart 


5.1 Writing Message Script 
Because the fequency bandwidth of the reproduced speech 
is narrower than that of natural speech, care should be 
taken to write a script that avoids the use of ambiguous 
expressions. Also, if messages that share common phrases 


are used, ROM area can be used effectively. 


5.2 Original Speech Recording 
Because the effective dynamic range for synthesized 
speech is narrow, the speaker (announcer) should take 
care to suppress inflections and to speak in an level, 
even tone of voice. 
Because the synthesizing process sometimes emphasizes 
noise in the original voice recording, care should be 
taken to keep all noise (particularly hum) to the abso- 
lute minimum. For this reason, the use of a professional 
recording studio using open reel tapes is recommended. 
Also, to permit selection of the sample with the best 
speech quality and the least noise, you are asked to 


provide several recordings of each message. 


5.3 Original Speech Evaluation and Editing 
Selection should be made from among the recorded messages 
taking into consideration lack of noise, even tone of 
voice, sound quality, etc. The messages thus selected 
should be edited onto an open reel or cassette tape. When 
an open reel tape is used, either the full tape or two 
tracks may be used. For a good S/N ratio when using a 
cassette, the use of metal tape is recommended. The 
recording should be done at a somewhat high level with 
peak values are in the range of +5 to +8dB. To ease the 
analysis process, the format shown in Fig. 5-2 should be 


used with the messages appropriately divided by blanks. 


poss" = Fagg === 5 


Messaye Messaye Blank Message Blank Message Blank 
#7 #2 (3 to 5 sec) #1 (3 to 5 sec) #0 (15 to 30 sec) 


Fig. 5-2 Original Speech Tape Format for Analysis 
When a cassette is used, only side A should be used. 


5.4 Parameter Selection 
The parameters to be used must be determined before the 
order for analysis is placed. For the uPD7755 /7756 , the 
Only parameter that must be selected by the user is the 
sampling frequency. 
Sampling frequency selection 
Please specify 4, 5, 6, or 8kHz as the sampling frequency. 
When 6kHz is specified, the frequency bandwidth is almost 
identical to that of a telephone. When 4kHz is selected, 
the bandwidth drops to Only those frequencies under 2kHz 
and there is some loss of Clarity. Although no definitive 
Statement can be made about selection of the sampling 
frequency, the guidelines laid out in Table 5-1 can be 


frequenc bandwidth (bps) 
sentences 


5kHz up to 2.2kHz | Male voice/ Slightly 10 to 20K 
distorted 


6kHz up to 2.7kHz Equivalent 12 to 24K 


used, 


Single words; 
female voice/ 


sentences 


Female voice/ 


Single words to telephone 


conversation 


up to 3.5kHz 16 to 32K 


Tape direction Start of tape 


Note that a great deal of noise in the message will cause 
the bit rate to be high. 


5.5 Ordering Speech Analysis 
Place your order for analyis after making the prepara- 
tions described above. Be sure to include all of the 
following when placing your order: 
Original speech tape (with company name and date clearly 
indicated) 
Message list (message selection codes in the same order 
as messages are recorded on the tape) 
Parameter specification (sampling frequency) 
All tapes should include the company name, section, and 


name of person responsible. 


5.6 Evaluation 
A cassette tape with a recording of the synthesized 
speech (result of analysis and coding) will be returned 
to the client along with the ROM code on an 8-inch floppy 
disk. 
If the result of evaluation is satisfactory, order the 
mask ROM specifications. 


If there are any problems, contact NEC. 
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